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1 : Immunology 1993 Jan;78(l):65-73 



Related Articles, Books, 



Characterization of T-cell responses to the house dust mite aller 
Der p II in mice. Evidence for major and cryptic epitopes. 

Hoyne GF, Callow MG, Kuo MC, Thomas WR 

Western Australian Research Institute for Child Health, Princess Margaret 
Hospital for Children, Perth. 

Major histocompatibility complex (MHC) congenic strains can be defined a 
and low responders to the major house dust mite allergen Der p II on the ba 
the ability to sensitize T cells for in vitro lymphokine release. Mice of the H 
haplotype were high responders, H-2k were intermediate and H-2d low 
responders. Like responses to other proteins, only a limited number of epito 
could be located by the response of T cells from mice immunized with aller 
a series of overlapping peptides. The epitopes for H-2b mice were 1 1-35, 78 
and 105-129, 36-50 and 78-104 for H-2k mice and 36-60 for H-2d. Immuni 
with the peptides however revealed that spleen-adherent cells were required 
lymph node cells to recall responses to the whole protein and in addition tha 
could be sensitized by cryptic epitopes defined by peptides 22-50 and 1-20 f 
2b mice. Peptides containing these cryptic epitopes did not normally induce 
responses in mice primed with the allergen, but when they were used for 
immunizing they could prime mice for responses to the peptide and the who 
allergen. The results both help to define a model for studying the presentatio 
allergens and have significant implications for peptide-based immunotherap 

PMID: 7679663, UI: 93170864 
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Sequence 79 from patent US 5776761 



Views: 
GenBank view 
FASTA view 
ASN.1 view 



Accession: ARO 16640 

Total Bases Sequenced: 1368 bp 

Completed: Dec 5, 1998. 

US PATENT: 5776761 
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Unknown. Unclassified. 



Nucleic acids encoding allergenic proteins from ragweed 

Rogers,B., Klapper,D.G., Rafhar,T. and Kuo,M. 
Patent: US 5776761-A 79 07-JUL-1998; 



Comments and suggestions to: [info(a),ncblnlm.nih.zov] 
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1 : ARO 16640 . Sequence 79 from p...[gi:3972917] 



Related Sequences 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



AR016640 1368 bp DNA PAT 

Sequence 79 from patent US 5776761. 
AR016640 

AR016640.1 GI:3972917 



Unknown . 

Unknown . 

Unclassified . 

1 (bases 1 to 1368) 

Rogers, B., Klapper, D. G. , Rafnar,T. and Kuo,M. 

Nucleic acids encoding allergenic proteins from ragweed 

Patent: US 5776761-A 79 07-JUL-1998; 

Location/Qualifiers 

1. .1368 

/organism= "unknown " 
a 286 c 327 g 



05-DEC-1998 



397 



358 t 



BASE COUNT 
ORIGIN 

1 ttgtatttta ccttagcact tgtcactttg gtgcaagctg gacgtcttgg cgaagaggtc 

61 gacatcttac cttcacctaa cgatacaagg aggagcctgc aaggatgtga agcacacaac 

121 attatagaca agtgttggag gtgcaaaccc gattgggcgg agaaccgaca agcgttaggc 

181 gattgtgcgc aaggttttgg aaaggcaact cacggcggaa aatggggtga tatctacatg 

241 gtcacaagtg atcaggatga tgatgttgta aatccaaaag aaggcacact ccggttcggt 

301 gctacccagg acaggccctt gtggatcatt tttcaaagag atatgattat ttatttgcaa 

361 caagagatgg tcgtaaccag cgacacgacc attgatggtc gaggggcgaa agttgagctc 

421 gtttatggag gtatcaccct catgaatgtc aagaatgtaa tcattcacaa catagatatc 

481 catgatgtta gagtgcttcc aggaggtagg attaagtcca atggtggtcc agccatacca 

541 agacatcaga gtgatggtga tgctatccat gttacgggta gttcagacat atggatcgac 

601 cattgcacgc tcagtaagtc atttgatggg ctcgtcgatg tcaactgggg cagcacagga 

661 gtaaccattt ccaactgcaa attcacccac cacgaaaaag ctgttttgct cggggctagt 

721 gacacgcatt ttcaagatct gaaaatgcat gtaacgcttg catacaacat cttcaccaat 

781 accgttcacg aaagaatgcc cagatgccga tttgggtttt tccaaatcgt taacaacttc 

841 tacgacagat gggataagta cgccatcggt ggtagctcga accctactat tctcagccaa 

901 gggaacaaat tcgtggcccc cgatttcatt tacaagaaaa acgtctgtct aaggactggt 

961 gcacaggagc cagaatggat gacttggaac tggagaacac aaaacgacgt gcttgaaaat 

1021 ggtgctatct ttgtggcatc tgggtctgat ccagtgctaa ccgctgaaca aaatgcaggc 

1081 atgatgcaag ctgaaccggg agatatggtt ccacaactca ccatgaatgc aggtgtactc 

1141 acatgctcgc ctggagcacc ttgctaagca cctggccaat tcctatgcaa cgatcataaa 

1201 tacttgctca ccataagtgt tcatttgatt agatttggac acgaatgatg taaccgattc 

1261 gtctgaatta tgatttgttt tgattctcag tttcataata tggcttcttg agagcaaaat 
1321 tagagaagag tgtctttgat caactacatt ttatggtttt tatattaa 

// 
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BLAST PubMed Nucleotide Protein Genome Structure PopSet Taxonomy Help 

Ambrosia artemisiifolia allergen Amb a VI (Amb-a-VI) mRNA, 
complete cds 



Views: 
GenBank view 
FASTA view 
ASN.1 view 
Coding Regions 
Feature table 



Accession: U89793 

Total Bases Sequenced: 357 bp 

Completed: Apr 1, 1997. 



1 i 1 1 1 1 I 



CDS with gene 
and mRNA 

gene, tRNA, 
promoter... 

Other features 
1 1 11 I '' 1 1 ' 11 11 I 3 57 



Legend: 
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I I - 100-208- 



^flmb-a-VI 



Organism: Ambrosia artemisiifolia 
Genetic Code: \ 

Lineage: Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 
euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; Asteridae; 
euasterids II; Asterales; Asteraceae; Asteroideae; Heliantheae; Ambrosia. 



Cloning and Expression of Ragweed Allergen Amb a VI 

Hiller,K.M, Lubahn,B.C. and Klapper,D.G. 
Unpublished 

Direct Submission 
Hiller,K.M. and Lubahn,B.C. 

Submitted (17-FEB-1997) Micro & Immuno, UNC-CH, CB#7290 631 FLOB, Chapel Hill, 
NC 27599, USA 



Comments and suggestions to: [infotyncbi.nlm.nihgov] 



http://wvw.ncbi.nlm.nih.gov/^ 



5/9/2000 



r,nucz-iN ucieouuc 



rage i ot 2 




Genome Structure 



Search l Nucleotiae ]l for 



PopSet 



Limits 



GenBank 



Preview/Index 

IS 



History 



Clipboard 



1 : U89793 . Ambrosia artemisii...[gi: 191 6291] 



Protein 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 



CDS 



BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 



AAU89793 357 bp mRNA PLN 01-APR-1997 

Ambrosia artemisiif olia allergen Amb a VI (Amb-a-VI) mRNA, complet 
cds . 
U89793 

U89793.1 GI: 1916291 

common ragweed. 
Ambrosia artemisiif olia 

Eukaryot a ; Viridiplant ae ; S t rept ophyt a ; Embryophyt a ; Tracheophyt a ; 
euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core 
eudicots; Asteridae; euasterids II; Asterales; Asteraceae; 
Asteroideae; Heliantheae; Ambrosia. 

1 (bases 1 to 357) 

Hiller, K.M. , Lubahn,B.C. and Klapper,D.G. 

Cloning and Expression of Ragweed Allergen Amb a VI 

Unpublished 

2 (bases 1 to 357) 
Hiller, K.M. and Lubahn,B.C. 
Direct Submission 

Submitted ( 17-FEB-1997 ) Micro & Immuno, UNC-CH, CB#7290 631 FLOB, 
Chapel Hill, NC 27599, USA 

Location/Qualifiers 

1. .357 

/organism="Ambrosia artemisiif olia" 
/db_xref="taxon: 4 212" 
/tissue_type=" flower" 
1. .357 

/gene="Amb-a-VI" 
1. .357 

/gene="Amb-a-VI" 

/note="also called Ra6; 9.9 kDa" 
/codon_start=l 

/product="allergen Amb a VI" 
/protein_id^" AAB51146.1 " 
/db_xref="GI : 1916292" 

/translation="MDCIRILWSVAVGLLLVSWRPTMFAASPTCDTVQNILAPCAGF 
TGQEPSKACCTGVNNLNNSRKTKADRVAVCNCIKELTKSIAYDPKRMPLLSTKCGVK 
DFPAVDKNLDCSKLPV" 
100 a 81 c 81 g 95 t 



// 



atggattgta ttcgcatact gtggtccgta gcagttggtt tgcttttggt ttcatggagg 
ccaacgatgt tcgcagcatc gccaacatgt gatactgtcc aaaacatatt ggcaccatgc 
gcaggtttct taacgggtca ggagccaagc aaggcgtgct gcacaggagt caacaatctt 
aataactcga gaaaaaccaa agctgatcgt gtggccgtct gcaactgtat caaagaattg 
acaaaatcga ttgcttacga tccaaaacgt atgcctcttc tgtctacgaa atgtggcgtt 
aaaccagatt ttcctgccgt cgataagaac ctcgactgtt caaaacttcc agtatga 
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BLAST PubMed Nucleotide Protein Genome Structure PopSet Taxonomy Help 

major fecal allergen Der pi- house-dust mite (Dermatophagoides 
pteronyssinus) (fragments) 



Views: 
GenBank view 
FASTA view 
ASN.1 view 
Feature table 



Total Residues Sequenced: 94 aa 
Completed: Mar 5, 1995. 



1 



Legend: 

- prote i n 



CDS with gene 
and mRNA 

gene, tRNA, 
promoter... 

Other features 



94 



*ma jor f eca I a I I er* 



1 - 0-100-.. . 



Organism: Dermatophagoides pteronyssinus 
Genetic Code: i 

Lineage: Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; Acariformes; 
Sarcopti formes; Astigmata; Analgoidea; Pyroglyphidae; Dermatophagoides. 



Structural studies on the allergen Der pi from the house dust mite Dermatophagoides 
pteronyssinus: similarity with cysteine proteinases 

Simpson,R.J., Nice,E.C, Moritz,R.L. and Stewart,G.A. 
Protein Seq. Data Anal. 2 (1), 17-21 (1989) 
89098855 



Comments and suggestions to: [infofyncbi.nlm.nih.govl 



http://www.ncbi.nlm.nih.gov 



5/9/2000 



Entrez-Protein 



Page 1 ot*2 



"3 NCBI 



PubMed 



• •• 



Nucleotide 



Search 1 ™" 11 for 



Protein 




GenPept 



Limits Preview/Index 



"Protein 



PopSet 



History 



Clipboard 



1 : S03380 . major fecal allerg...[gi: 1078971] 



PubMed, Related Sequences 



LOCUS S03380 94 aa INV 01-SEP-1995 

DEFINITION major fecal allergen Der pi- house-dust mite ( Dermatophagoides 

pteronyssinus) (fragments) . 
ACCESSION S03380 
PID gl078971 
VERSION S03380 GI: 1078971 

DBSOURCE pir: locus S03380; 

summary: #length 94 #checksum 3067; 

PIR dates: 05-Mar-1995 #sequence_revision Ol-Sep-1995 #text_change 
Ol-Sep-1995; 

punctuation in sequence. 

KEYWORDS 

SOURCE Dermatophagoides pteronyssinus. 

ORGANISM Dermatophagoides pteronyssinus 

Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; 
Acariformes; Sarcoptif ormes; Astigmata; Analgoidea; Pyroglyphidae; 
Dermatophagoides. 
1 (residues 1 to 94) 

Simpson, R. J. , Nice,E.C, Moritz,R.L. and Stewart, G. A, 
Structural studies on the allergen Der pi from the house dust mite 
Dermatophagoides pteronyssinus: similarity with cysteine 
proteinases 

Protein Seq. Data Anal. 2 (1), 17-21 (1989) 
89098855 

Location/Qualifiers 
1..94 

/organism="Dermatophagoides pteronyssinus" 
/db_xref="taxon: 6956" 
Protein 1 . . 94 

/product="major fecal allergen Der pi" 
/note="allergen Der pi" 

ORIGIN 



REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
FEATURES 

source 



// 



1 tnacsingna paeidlrqmr tvtpirmqmq ggcgsxxafs gvagieyiqh ngvvqesyyr 
61 fgisnycqiy ppnankdngy qpnyxavniv gyxn 



Revised: January 10, 2000. 
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BLAST PubMed Nucleotide Protein Genome Structure PopSet Taxonomy Help 

Dermatophagoides farinae mRNA for mite allergen Der f II 
precursor, partial cds, clone: pFLll 



Views: 
GenBank view 
FASTA view 
ASN.1 view 
Coding Regions 
Feature table 



Accession: D 10449 

Total Bases Sequenced: 485 bp 

Completed: Apr 28, 1993. 



CDS with gene 
and mRNA 

gene, tRNA, 
promoter... 



Other features 
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485 



"" """ m i te a I lergen Der f II pre 

po I yfl_s i gna I ■ 



Legend: 

— - CDS — — - other feature 
I 1 ' ' 1 I - 109-200-... 



Organism: Dermatophagoides farinae 
Genetic Code: i 

Lineage: Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; Acariformes; 
Sarcoptiformes; Astigmata; Analgoidea; Pyroglyphidae; Dermatophagoides. 



Cloning and expression of cDNA coding for the major house dust mite allergen Der f II 
in Escherichia coli 

Yuuki,T., Okumura,Y., Ando,T., Yamakawa,H., Suko,M, Haida,M. and Okudaira,H. 

Agric. Biol. Chem. 55 (5), 1233-1238 (1991) 

91291341 

Direct Submission 

Yuuki,T. 

Submitted (31 -JAN- 1992) to the DDBJ/EMBL/GenBank databases. Toshifumi Yuuki, Asahi 
Breweries, Ltd., Central Research Laboratories; 2-13-1, Ohmori-kita, Ohta-ku, Tokyo 143, 
Japan (Tel:03-5493-3255, Fax:03-5493-7027) 

Comments and suggestions to: [infotajnebi. nlm.nih.gov] 
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Save 
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History 
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1 : D 10449 . Dermatophagoides f...[gi:217307] 



PubMed, Protein, Related Sequences 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



mat_peptide 



DEPDER3 485 bp mRNA INV 01-FEB-2000 

Dermatophagoides farinae mRNA for mite allergen Der f II precursor 
partial cds, clone:pFLll. 
D10449 

D10449.1 GI:217307 

Der II major allergen group; Der f II; mite allergen. 
Dermatophagoides farinae cDNA to mRNA, clone rpFLll. 
Dermatophagoides farinae 

Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; 
Acariformes; Sarcoptif ormes ; Astigmata; Analgoidea; Pyroglyphidae; 
Dermatophagoides. 

1 (bases 1 to 485) 

Yuuki,T., Okumura,Y., Ando,T., Yamakawa,H., Suko,M., Haida,M. and 
Okudaira, H . 

Cloning and expression of cDNA coding for the major house dust mit 
allergen Der f II in Escherichia coli 
Agric. Biol. Chem. 55 (5), 1233-1238 (1991) 
91291341 

2 (bases 1 to 485) 
Yuuki, T . 

Direct Submission 

Submitted ( 31- JAN-1992 ) to the DDBJ/EMBL/GenBank databases. 
Toshifumi Yuuki, Asahi Breweries, Ltd., Central Research 
Laboratories; 2-13-1, Ohmori-kita, Ohta-ku, Tokyo 143, Japan 
(Tel: 03-54 93-3255, Fax:03-54 93-7027) 

Location/Qualif iers 

1. .485 

/organism=" Dermatophagoides farinae" 

/db_xref="taxon: 6954" 

/clone="pFLll" 

<1. .419 

/codon_start=3 

/product="mite allergen Der f II precursor" 
/protein_id=" BAA0124 1 .1 " 
/db_xref = "GI : 217308 " 

/translation="SLLVAAWADQVDVKDCANNEIKKVMVDGCHGSDPCIIHRGKP 
TLEALFDANQNTKTAKIEIKASLDGLEIDVPGIDTNACHFMKCPLVKGQQYDAKYTW 
VPKI APKSENVVVT VKLVGDNGVLACAI ATHAKI RD " 
30. .416 

/ evidence=experimental 
/product="Der f II" 
467. .472 
485 

94 c 87 g 138 t 



polyA_siqnal 

polyA_site 
BASE COUNT 166 a 

ORIGIN 

1 tttcattgtt ggtagcagcc gttgttgccg atcaagtcga tgttaaagat tgtgccaaca 
61 atgaaatcaa aaaagtaatg gtcgatggtt gccatggttc tgatccatgc atcatccatc 
121 gtggtaaacc attcactttg gaagccttat tcgatgccaa ccaaaacact aaaaccgcta 
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181 aaattgaaat caaagccagc ctcgatggtc ttgaaattga tgttcccggt atcgatacca 

241 atgcttgcca ttttatgaaa tgtccattgg ttaaaggtca acaatatgat gccaaatata 

301 catggaatgt gccgaaaatt gcaccaaaat ctgaaaacgt tgtcgttaca gtcaaacttg 

361 ttggtgataa tggtgttttg gcttgcgcta ttgctaccca cgctaaaatc cgtgattaaa 

421 aaaaaaaaat aaatatgaaa attttcacca acatcgaaca aaattcaata accaaaattt 

481 gaatc 



// 




GenBank 
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/ BLAST PubMed Nucleotide Protein Genome Structure PopSet Taxonomy Help 

Human gene for muscarinic acetylcholine receptor HM4 



Views: 
GenBank view 
FASTA view 
ASN.1 view 
Coding Regions 
Feature table 



Accession: XI 5266 

Total Bases Sequenced: 1913 bp 

Completed: Feb 19, 1990. 



1 



-i 1 i 



CDS with gene 
and mRNA 

gene, tRNA, 
promoter... 

Other features 

-i 1 1 1 1 r— 



Legend: 

— - CDS 



-i J 1 1 r- 



1 - 1000-2000- 



1913 



*HM4 muscarinic rec 



Organism: Homo sapiens 
Genetic Code: i 

Lineage: Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; 
Primates; Catarrhini; Hominidae; Homo. 



Distinct primary structures, ligand-binding properties and tissue-specific expression of 
four human muscarinic acetylcholine receptors 

Peralta,E.G., Ashkenazi,A., Winslow,J.W., Smith,D.H., Ramachandran,J. and Capon,D J. 

EMBO J. 6 (13), 3923-3929 (1987) 

88166632 

COMMENT See X 1 5263- 1 5266 for other human muscarinic acetylcholine receptor genes. 
Comments and suggestions to: finfo(q) y ncbi.nlm.n ih.govJ 
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1 : X15266 . Human gene for mus...[gi:32323] 



PubMed, Protein, Related Sequences, LinkOut 



REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
MEDLINE 
COMMENT 

FEATURES 

source 



LOCUS HSHM4 1913 bp DNA PRI 14-JAN-2000 

DEFINITION Human gene for muscarinic acetylcholine receptor HM4 . 
ACCESSION X15266 X13530 
VERSION X15266.1 GI:32323 

KEYWORDS muscarinic acetylcholine receptor. 
SOURCE human. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1913) 

Peralta, E.G. , Ashkenazi, A. , Winslow, J.W. , Smith, D.H. , 
Ramachandran, J. and Capon, D.J. 

Distinct primary structures, ligand-binding properties and 
tissue-specific expression of four human muscarinic acetylcholine 
receptors 

EMBO J. 6 (13), 3923-3929 (1987) 
88166632 

See X15263-15266 for other human muscarinic acetylcholine receptor 
genes . 

Location/Qualifiers 
1. .1913 
/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
/clone_lib="human genomic library" 
CDS 101. .1873 

/note="HM4 muscarinic receptor (AA 1-590)" 

/codon_start-l 

/protein_id=" CAA33337 . 1 " 

/db_xref="GI: 32324" 

/db_xref="SWISS-PROT:P20309" 

/translation="MTLHNNSTTSPLFPNISSSWIHSPSDAGLPPGTVTHFGSYNVS 
AAGNFSSPDGTTDDPLGGHTVWQVVFIAFLTGILALVTIIGNILVIVSFKVNKQLKT 
NNYFLLSLACADLIIGVISMNLFTTYIIMNRWALGNLACDLWLAIDYVASNASVMNL 
VI S FDRYFS I TRPLTYRAKRTTKRAGVMI GLAWVI S FVLWAPAI LFWQYFVGKRT VP 
GECFIQFLSEPTITFGTAIAAFYMPVTIMTILYWRIYKETEKRTKELAGLQASGTEA 
TENFVHPTGSSRSCSSYELQQQSMKRSNRRKYGRCHFWFTTKSWKPSSEQMDQDHSS 
DSWNNNDAAASLENSASSDEEDIGSETRAI YSIVLKLPGHSTILNSTKLPSSDNLQV 
EEELGMVDLERKADKLQAQKSVDDGGSFPKSFSKLPIQLESAVDTAKTSDVNSSVGK 
TATLPLS FKEATLAKRFALKTRSQI TKRKRMSLVKEKKAAQTLSAI LLAFI I TWT PY 
IMVLVNTFCDSCIPKT FWNLGYWLCYINSTVNPVCYALCNKTFRTTFKMLLLCQCDK 
KRRKQQYQQRQSVI FHKRAPEQAL " 

BASE COUNT 496 a 529 c 444 g 444 t 

ORIGIN 

1 tcttttaacg tatgtaatgc aaagaacaaa caaataaagg cagaaatttt tctaactctg 
61 tctcttctct ctttccccca gactatgtca gagagtcaca atgaccttgc acaataacag 
121 tacaacctcg cctttgtttc caaacatcag ctcctcctgg atacacagcc cctccgatgc 
181 agggctgccc ccgggaaccg tcactcattt cggcagctac aatgtttctc gagcagctgg 
241 caatttctcc tctccagacg gtaccaccga tgaccctctg ggaggtcata ccgtctggca 
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301 agtggtcttc atcgctttct taacgggcat cctggccttg gtgaccatca tcggcaacat 
361 cctggtaatt gtgtcattta aggtcaacaa gcagctgaag acggtcaaca actacttcct 
421 cttaagcctg gcctgtgccg atctgattat cggggtcatt tcaatgaatc tgtttacgac 
481 ctacatcatc atgaatcgat gggccttagg gaacttggcc tgtgacctct ggcttgccat 
541 tgactacgta gccagcaatg cctctgttat gaatcttctg gtcatcagct ttgacagata 
601 cttttccatc acgaggccgc tcacgtaccg agccaaacga acaacaaaga gagccggtgt 
661 gatgatcggt ctggcttggg tcatctcctt tgtcctttgg gctcctgcca tcttgttctg 
721 gcaatacttt gttggaaaga gaactgtgcc tccgggagag tgcttcattc agttcctcag 
781 tgagcccacc attacttttg gcacagccat cgctgctttt tatatgcctg tcaccattat 
841 gactatttta tactggagga tctataagga aactgaaaag cgtaccaaag agcttgctgg 
901 cctgcaagcc tctgggacag aggcagagac agaaaacttt gtccacccca cgggcagttc 
961 tcgaagctgc agcagttacg aacttcaaca gcaaagcatg aaacgctcca acaggaggaa 
1021 gtatggccgc tgccacttct ggttcacaac caagagctgg aaacccagct ccgagcagat 
1081 ggaccaagac cacagcagca gtgacagttg gaacaacaat gatgctgctg cctccctgga 
1141 gaactccgcc tcctccgacg aggaggacat tggctccgag acgagagcca tctactccat 
1201 cgtgctcaag cttccgggtc acagcaccat cctcaactcc accaagttac cctcatcgga 
1261 caacctgcag gtgcctgagg aggagctggg gatggtggac ttggagagga aagccgacaa 
1321 gctgcaggcc cagaagagcg tggacgatgg aggcagtttt ccaaaaagct tctccaagct 
1381 tcccatccag ctagagtcag ccgtggacac agctaagact tctgacgtca actcctcact 
1441 gggtaagagc acggccactc tacctctgtc cttcaaggaa gccactctgg cce^agaggtt 
1501 tgctctgaag accagaagtc agatcactaa gcggaaaagg atgtccctgg tcaaggagaa 
1561 gaaagcggcc cagaccctca gtgcgatctt gcttgccttc atcatcactt ggaccccata 
1621 caacatcatg gttctggtga acaccttttg tgacagctgc atacccaaaa ccttttggaa 
1681 tctgggctac tggctgtgct acatcaacag caccgtgaac cccgtgtgct atgctctgtg 
1741 caacaaaaca ttcagaacca ctttcaagat gctgctgctg tgccagtgtg acaaaaaaaa 
1801 gaggcgcaag cagcagtacc agcagagaca gtcggtcatt tttcacaagc gcgcacccga 
1861 gcaggccttg tagaatgagg ttgtatcaat agcagtgaca aaacgacaca tea 

// 




Revised: May 2, 2000. 



Disclaimer I Write to the Help Desk 
NCBI | NLM | NIH 



http://www.ncbi.nlm.nih.gov: 80 



5/9/2000 



it' Summaiy sequence view 



rage l ol l 




99 




MJ&- 



•U3- 



Protein 

BLAST PubMed Nucleotide Protein Genome Structure PopSet Taxonomy Help 
collagen alpha 3(IV) chain precursor, long splice form - human 



Views: 
GenBank view 
FASTA view 
ASN.1 view 
Feature table 



Total Residues Sequenced: 1670 aa 
Completed: Oct 28, 1994. 



CDS with gene 
and mRNA 

gene, tRNA, 
promoter... 

Other features 



1670 
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•disulfide 'disulfide 'disulfide 
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disulfide^ - "disulfide 
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disulfide" ^—disulfide 
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1 - 1800-2088- 



Organism: Homo sapiens 
Genetic Code: i 

Lineage: Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; 
Primates; Catarrhini; Hominidae; Homo. 



Sequence and localization of a partial cDNA encoding the human alpha 3 chain of type 
IV collagen 

Morrison,K.E., Mariyama,M., Yang-Feng,T.L. and Reeders,S.T. 

Am. J. Hum. Genet. 49 (3), 545-554 (1991) 

91353570 

Molecular cloning of the human Goodpasture antigen demonstrates it to be the alpha 3 
chain of type IV collagen 

Turner,N., Mason,P.J., Brown,R., Fox,M., Povey,S., Rees,A. and Pusey,C.D. 

J. Clin. Invest. 89 (2), 592-601 (1992) 

92147878 

Exon/intron structure of the human alpha 3(IV) gene encompassing the Goodpasture 
antigen (alpha 3(IV)NC1). Identification of a potentially antigenic region at the triple 
helix/NCI domain junction 

Quinones,S., Bernal,D., Garcia-Sogo,M., Elena,S.F. and Saus,J. 

J. Biol. Chem. 267 (28), 19780-19784 (1992) 

93015826 

The human mRNA encoding the Goodpasture antigen is alternatively spliced 

Bernal,D., Quinones,S. and Saus,J. 

J. Biol. Chem. 268 (16), 12090-12094 (1993) 

93280184 

Exon/intron structure f the human alpha 3(1 V) gene encompassing the Goodpasture 
antigen (alpha 3(IV)NC1). Identification of a potentially antigenic region at the triple 
helix/NCI domain junction 
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Quinones,S., Bernal,D., Garcia-Sogo,M, Elena,S.F. and Saus,J. 

J. Biol. Chem. 269 (25), 17358 (1994) 

94274734 

Complete primary structure of the human alpha 3(IV) collagen chain. Coexpression of 
the alpha 3(IV) and alpha 4(IV) collagen chains in human tissues 

Mariyama,M., Leinonen,A., Mochizuki,T., Tryggvason,K. and Reeders,S.T. 

J. Biol. Chem. 269 (37), 23013-23017 (1994) 

94364994 

COMMENT Prolines and lysines at the third position of the tripeptide repeating unit (G-X- 
Y) are hydroxylated to varying extents. Prolines are predominately 4-hydroxylated. Lysines 
are 5-hydroxylated and subsequently O-glycosylated. In Goodpasture's syndrome, an 
autoimmune response develops against an epitope in the carboxyl-terminal nonhelical NC 1 
domain. This minor type IV collagen is thought to form a heterotrimer of two alpha 3(IV) 
chains and one alpha 4(1 V) chain (see PIR:CGHU1B). A polymeric network forms with 
tetrameric associations among trimer amino- terminal domains (with disulfide and desmosine 
cross-links), dimeric associations among trimer carboxyl-terminal domains (with disulfide 
bonds), and both intra-trimer and inter-trimer associations in the interrupted helical domain 
(with disulfide and desmosine cross-links). 

Comments and suggestions to: finfo(qlncbi.nlm.mh.govJ 
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1 : CGHU3B . collagen alpha 3(I...[gi: 1360672] 



PubMed, Related Sequences 



LOCUS 

DEFINITION 

ACCESSION 

PID 

VERSION 
DBSOURCE 



KEYWORDS 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
REMARK 

REFERENCE 
AUTHORS 



CGHU3B 1670 aa PRI 22-JUN-1999 

collagen alpha 3 (IV) chain precursor, long splice form - human. 
CGHU3B 
g!360672 

CGHU3B GI: 1360672 
pir: locus CGHU3B; 

summary: #length 1670 #molecular-weight 161725 tchecksum 818; 
genetic: #gene GDB:C0L4A3 ##cross-ref erences GDB: 128351; 
OMIM: 120070 #map_position 2q36-2q37 #introns 1385/1; 1418/1; 
1488/1; 1547/2; 1585/3; 1643/2 #note the alpha 3 (IV) and alpha 
4 (IV) chain genes are encoded on opposite strands with overlapping 
promotor regions; defects in this gene can result in recessive for 
Alport's syndrome #status incomplete; 
superfamily: collagen alpha 1(IV) chain; 

PIR dates: 28-Oct-1994 #sequence_revision 03~Oct-1995 #text change 
22-Jun-1999. 

alternative splicing; basement membrane; cell binding; coiled coil 
extracellular matrix; glycoprotein; hydroxylysine; hydroxyproline; 
trimer; triple helix, 
human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (residues 1 to 1670) 

Morrison, K.E. , Mariyama,M., Yang-Feng, T . L . and Reeders,S.T. 

Sequence and localization of a partial cDNA encoding the human 

alpha 3 chain of type IV collagen 

Am. J. Hum. Genet. 49 (3), 545-554 (1991) 

91353570 

2 (residues 1 to 1670) 

Turner,N., Mason, P.J. , Brown, R. , Fox,M., Povey,S., Rees,A. and 
Pusey, C. D. 

Molecular cloning of the human Goodpasture antigen demonstrates it 
to be the alpha 3 chain of type IV collagen 
J. Clin. Invest. 89 (2), 592-601 (1992) 
92147878 

3 (residues 1 to 1670) 

Quinones,S., Bernal,D., Garcia-Sogo, M. , Elena, S.F. andSaus,J. 
Exon/intron structure of the human alpha 3 (IV) gene encompassing 
the Goodpasture antigen (alpha 3 (IV) NCI). Identification of a 
potentially antigenic region at the triple helix/NCI domain 
junction , 
J. Biol. Chem. 267 (28), 19780-19784 (1992) 
93015826 

Erratum: [ [published erratum appears in J Biol Chem 1994 Jun 
24;269(25) :17358] ] 

4 (residues 1 to 1670) 
Bernal,D., Quinones,S. and Saus,J. 
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TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
MEDLINE 
COMMENT 



FEATURES 

source 



Region 
Protein 

Region 

Region 

Bond 

Bond 

Bond 

Bond 



The human mRNA encoding the Goodpasture antigen is alternatively 
spliced 

J. Biol. Chem. 268 (16), 12090-12094 (1993) 
93280184 

5 (residues 1 to 1670) 

Quinones,S., Bernal,D., Garcia-Sogo, M. , Elena, S.F. and Saus,J. 
Exon/intron structure of the human alpha 3 (IV) gene encompassing 
the Goodpasture antigen (alpha 3(IV)NC1). Identification of a 
potentially antigenic region at the triple helix/NCI domain 
junction 

J. Biol. Chem. 269 (25), 17358 (1994) 
94274734 

annotation; erratum; correction to intronic sequence in A44043 

6 (residues 1 to 1670) 

Mariyama,M., Leinonen,A., Mochizuki, T . , Tryggvason, K. and 
Reeders, S.T. 

Complete primary structure of the human alpha 3 (IV) collagen chain 
Coexpression of the alpha 3 (IV) and alpha 4 (IV) collagen chains in 
human tissues 

J. Biol. Chem. 269 (37), 23013-23017 (1994) 
94364994 

Prolines and lysines at the third position of the tripeptide 
repeating unit (G-X-Y) are hydroxylated to varying extents. 
Prolines are predominately 4 -hydroxylated. Lysines are 
5-hydroxylated and subsequently O-glycosylated. 

In Goodpasture's syndrome, an autoimmune response develops against 
an epitope in the carboxyl-terminal nonhelical NCI domain. 
This minor type IV collagen is thought to form a heterotrimer of 
two alpha 3(IV) chains and one alpha 4 (IV) chain (see PIR:CGHU1B) . 
A polymeric network forms with tetrameric associations among trime 
amino-terminal domains (with disulfide and desmosine cross-links), 
dimeric associations among trimer carboxyl-terminal domains (with 
disulfide bonds), and both intra-trimer and inter-trimer 
associations in the interrupted helical domain (with disulfide and 
desmosine cross-links) . 

Location /Qualifiers 

1. .1670 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
1. .28 

/region_name= "domain" 
/note="signal sequence" 
1. .1670 

/product="collagen alpha 3 (IV) chain precursor, long 
splice form" 

/note="Goodpasture antigen; procollagen alpha 3 (IV) chain 
long splice form" 
29. .1670 

/region_name= "product " 

/note="collagen alpha 3 (IV) chain, long splice form" 
29. .42 

/ region_name= "doma in " 

/note="amino-terminal nonhelical, NH1 #label NH1 " 
bond (31) 

/bond_type= "disulfide" 
/not e=" interchain" 
bond (33) 

/bond_type="disulf ide" 
/not e=" interchain" 
bond(39) 

/bond_type= "disulfide " 
/note=" interchain" 
bond(41) 

/bond_type="disulf ide " 
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/note="interchain" 
Region 43.. 1438 

/region_name= "region" 

/note="interrupted helical" 
Bond bond (125) 

/bond_type= "disulfide" 

/note="interchain" 
Site 253 

/site_type="binding" 

/note="carbohydrate (Asn) (covalent) " 
Bond bond (4 22) 

/bond_type="disulf ide" 

/note=" interchain" 
Bond bond (4 76) 

/bond_type="disulf ide " 

/note="interchain" 
Bond bond (479) 

/bond_t ype= " di sul f ide " 

/note="interchain" 
Bond bond (682) 

/bond_type="disulfide" 

/note="interchain" 
Bond bond (722) 

/bond_type="disulf ide" 

/note=" interchain" 
Region 791.. 793 

/region_name= "region" 

/note="cell attachment (R-G-D) motif" 
Bond bond (809) 

/bond_type="disulf ide " 

/note="interchain" 
Region 996.. 998 

/region_name=" region" 

/note="cell attachment (R-G-D) motif" 
Region 1154.. 1156 

/region_name=" region" 

/note="cell attachment (R-G-D) motif" 
Region 1306.. 1308 

/region_name=" region" 

/note="cell attachment (R-G-D) motif" 
Region 1345.. 1347 

/region_name= "region" 

/note="cell attachment (R-G-D) motif" 
Bond bond (1387) 

/bond_type= "disulfide" 

/not e=" interchain" 
Region 1432.. 1434 

/region_name=" region" 

/note="cell attachment (R-G-D) motif" 
Region 1439.. 1670 

/ r e g i on_name= " doma in" 

/note="carboxyl-terminal nonhelical, NCI #label NCI" 
Region 1451.. 1551 

/ r e gi on_name= " doma in" 

/note="collagen IV carboxyl-terminal repeat #label CT1" 
Bond bond(1460, 1548) 

/bond_t ype= "disulfide" 

/note="(or 1460-1551, 1493-1548)" 
Bond bond(1493, 1551) 

/bond_t ype= " di sul f i de " 

/note="(or 1460-1551, 1493-1548)" 
Bond bond(1505, 1511) 

/bond_type= "disulfide" 
Region 1561.. 1665 
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Bond 



Bond 



Bond 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 

// 



/region_name=" domain" 

/note="collagen IV carboxyl-terminal repeat #label CT2" 

bond<1570, 1662) 

/bond_t ype= " di sul f i de " 

/note="(or 1570-1665, 1604-1662)" 

bond(1604, 1665) 

/bond_type= "disulfide" 

/note="(or 1570-1665, 1604-1662)" 

bond(1616, 1622) 

/bond_type="disulf ide" 



msartaprpq 
gqkgftgpeg 
gvpgcsgskg 
pqglpgppgf 
pgtvivtltg 
kdgvpgfpgs 
ppgprgargp 
gcagspglpg 
yipgppglpg 
getlqpegqv 
pgspgspgpa 
pgppghpgpq 
gcpgkmgepg 
tpgnegldgp 
pgipgldrsg 
gppgppgnpg 
gf agnpgekg 
grnpgskgkrg 
kkgemgqpgp 
lpgfpgspgp 
glpgrkgamg 
shvigikgdk 
kgnpgf lgsi 
pgnlgpcgpr 
wttrgf vf tr 
pf If cnvndv 
hsqttdippc 
tcnyysnsys 



vlllplllvl 

ltgpqgpkgf 

eqgfpglpgt 

pgpvgppgpp 

pdnrtdlkge 

egvkgnrgfp 

qgpsgppgvp 

spgppgppgd 

lpglhgvkgi 

gvpgdpglrg 

gpagppgygp 

gppgipgslg 

lpgkpglpga 

rgdpgqpgpp 

fpgetgspgi 

tpghrgspgi 

nrgvpgmpgl 

tlgfpgragr 

pghlgpagpe 

mgirgdqgrd 

dagprgptgi 

gsmghpgpkg 

gppgpigpkg 

gkpgkdgkpg 

hsqttaipsc 

cnf asrndys 

phgwislwkg 

f wlaslnper 



laaapaaskg 

pglpgltgsk 

lgypgipgaa 

gf fgfpgamg 

kgdkgamgep 

glmgedgikg 

gspgssrpgl 

igfrkgppgd 

pgrqgaaglk 

qpgrkgldgi 

qgepglqgtq 

kcgdpglpgp 

kgepavampg 

geqgppgrci 

pghqgemgpl 

pgvkgqrgtp 

kglkglpgpa 

pglpgihglq 

gapgspgspg 

gipgpagekg 

egfpgppglp 

ppgtagdmgp 

ppgvrgdpgt 

tpgpagekgn 

pegtvplysg 

ywlstpalmp 

f sf imf tsag 

mf rkpipstv 



cvckdkgqcf 

gvrgisglpg 

glkgqkgapa 

prgpkghmge 

gppgpsglpg 

qkgdigppgf 

rgapgwpgli 

hglpgylgsp 

gspgspgntg 

pgtpgvkglp 

gvpgapgppg 

dgepgipgig 

gpgtpgfpge 

egprgaqglp 

gqrgypgnpg 

gakgeqgdkg 

gppgprgdlg 

gdkgepgyse 

lpgkpgphgd 

etgllrappg 

gaiipgqtgn 

pgrlgapgtp 

lkiislpgsp 

kgskgepgpa 

f sf If vqgnq 

mnmapitgra 

segtgqalas 

kagelekiis 



cdgakgekge 

f sgspglpgt 

keedieldak 

rvighkgerg 

esygsekgap 

rgpteyydty 

gskgergrpg 

gipgvdgpkg 

Ipgfpgfpga 

gpkgelalsg 

eagprgelsv 

fpgppgpkgd 

rgnsgehgei 

glnglkgqqg 

ilgppgedgv 

npgpseishv 

stgnpgepgl 

gtrpgppgpt 

lgf kgikgll 

prgnpgaqga 

rgppgsrgsp 

glpgprgdpg 

gppgtpgepg 

gsdglpglkg 

rahgqdlgtl 

lepyisrctv 

pgscleef ra 

rcqvcmkkrh 



kgfpgppgsp 

pgntgpyglv 

gdpglpgapg 

vkgltgppgp 

gdpglqgkpg 

qekgdegtpg 

kdamgtpgsp 

epgllctqcp 

qgdpglkgek 

ekgdqgppgd 

stpvpgppgp 

qgfpgtkgsl 

glpglpglpg 

rrgktgpkgd 

igmmgfpgai 

igdkgepglk 

lgipgsmgnm 

gdpglpgdmg 

gppgirgppg 

kgdrgapgfp 

gapgppgppg 

f qgfpgvkge 

mqgepgppgp 

krgdsgspat 

gsclqrf ttm 

cegpaiaiav 

spf lechgrg 
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BLAST PubMed Nucleotide Protein Genome Structure PopSet Taxonomy Help 
Ku autoantigen p70 subunit [human, mRNA, 2123 nt] 



Views: 
GenBank view 
FASTA view 
ASN.1 view 
Coding Regions 
Feature table 



Accession: S3 8729 

Total Bases Sequenced: 2123 bp 

Completed: May 8, 1993. 



-i 1 1 1 r 1 1 | 



CDS with gene 
and mRNA 

gene, tRNA, 
promoter... 

Other features 
■ ' ~2123 



%Ku autoantigen p70 subuni 



Legend: 
— - CDS 



■ - gene 



r 



1 ' ■ 1 ' 1 - 1 089-2080-. 



Links: 

Organism: Homo sapiens 
LocusLink Genetic Code: \ 

Lineage: Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; 
Primates; Catarrhini; Hominidae; Homo. 



Nucleotide sequence and genomic structure analyses of the p70 subunit of the human 
Ku autoantigen: evidence for a family of genes encoding Ku (p70)- related polypeptides 

Griffith,A.J., Craft,J., Evans,J., Mimori,T. and Hardin,J.A. 

Mol. Biol. Rep. 16 (2), 91-97 (1992) 

92301477 



Comments and suggestions to: finfofalncbi.nlm.nikgov] 
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1 : S38729 . Ku autoantigen p70...[gi:250496] 



PubMed, Protein, Related Sequences, LinkOut 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
REMARK 



FEATURES 

source 



S38729 2123 bp mRNA 

Ku autoantigen p70 subunit [human, 

S38729 

S38729.1 GI:250496 



PRI 

mRNA, 2123 nt] 



08-MAY-1993 



gene 



CDS 



human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 2123) 

Griffith, A. J. , Craft, J., Evans, J., Mimori,T. and Hardin, J. A. 
Nucleotide sequence and genomic structure analyses of the p70 
subunit of the human Ku autoantigen: evidence for a family of gene 
encoding Ku (p70 ) -related polypeptides 
Mol. Biol. Rep. 16 (2), 91-97 (1992) 
92301477 

GenBank staff at the National Library of Medicine created this 
entry [NCBI gibbsq 107205] from the original journal article. 
This sequence comes from Fig. 3. 

Location/Qualifiers 

1. .2123 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
25. . 1854 

/gene="Ku autoantigen p70 subunit" 
25. .1854 

/gene="Ku autoantigen p70 subunit" 
/note="This sequence comes from Fig. 3" 
/codon_start=l 

/product="Ku autoantigen p70 subunit" 
/prot ein_id=" AAB2238 1 . 1 " 
/db_xref="GI:2504 97" 

/translation="MSGWESYYKTEGDEEAEEEQEENLEASGDYKYSGRDSLIFLVD 
SKAMFESQSEDELTPFDMSIQCIQSVYISKIISSDRDLLAVVFYGTEKDKNSVNFKN 
YVLQELDNPGAKRILELDQFKGQQGQKRFQDMMGHGSDYSLSEVLWVCANLFSDVQF 
MSHKRIMLFTNEDNPHGNDSAKASRARTKAGDLRDTGI FLDLMHLKKPGGFDI SLFY 
DIISIAEDEDLRVHFEESSKLEDLLRKVRAKETRKRALSRLKLKLNKDIVISVGIYN 
VQKALKPPPIKLYRETNEPVKTKTRTFNTSTGGLLLPSDTKRSQIYGSRQIILEKEE 
EELKRFDDPGLMLMGFKPLVLLKKHHYLRPSLFVYPEESLVIGSSTLFSALLIKCLE 
EVAALCRYTPRRNIPPYFVALVPQEEELDDQKIQVTPPGFQLVFLPFADDKRKMPFT 
KIMATPEQVGKMKAIVEKLRFTYRSDSFENPVLQQHFRNLEALALDLMEPEQAVDLT 
PKVEAMNKRLGSLVDEFKELVYPPDYNPEGKVTKRKHDNEGSGSKRPKVEYSEEELK 
HISKGTLGKFTVPMLKEACRAYGLKSGLKKQELLEALTKHFQD" 
455 c 549 g 484 t 



635 a 



BASE COUNT 
ORIGIN 

1 gcgccaaagt gagcagtagc caacatgtca gggtgggagt catattacaa aaccgagggc 
61 gatgaagaag cagaggaaga acaagaagag aaccttgaag caagtggaga ctataaatat 
121 tcaggaagag atagtttgat ttttttggtt gatgcctcca aggctatgtt tgaatctcag 
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181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 



agtgaagatg 

agtaagatca 

gacaaaaatt 

gcaaaacgaa 

gacatgatgg 

ctctttagtg 

gacaaccccc 

ctccgagata 

atatccttgt 

tttgaggaat 

aagcgagcac 

atttataatc 

aatgaaccag 

cctagcgata 

gaaacagaag 

ttggtactgc 

tcgctggtga 

gaggttgcag 

ttggtgccac 

cagctggtct 

atggcaactc 

tacagaagtg 

ttggccttgg 

gcaatgaata 

gattacaatc 

aaaaggccca 

ctgggcaagt 

ggtctgaaga 

gccgcgcgtc 

gttaaaatgt 

tgtttttgag 

actttactgc 

aaaaaaaaaa 



agttgacacc 

taagcagtga 

cagtgaattt 

ttctagagct 

gccacggatc 

atgtccaatt 

atggcaatga 

caggcatctt 

tctacagaga 

ccagcaagct 

tcagcaggtt 

tggtccagaa 

tgaaaaccaa 

ccaagaggtc 

agctaaaacg 

tgaagaaaca 

ttgggagctc 

cattgtgcag 

aggaagaaga 

ttttaccctt 

cagagcaggt 

acagctttga 

atttgatgga 

aaagactggg 

ctgaagggaa 

aggtggagta 

tcactgtgcc 

agcaggagct 

cagctgccct 

gtttctcctg 

gctttctgtt 

ctgaataaag 

aaaaaaaaaa 



ttttgacatg 

tcgagatctc 

taaaaatatt 

tgaccagttt 

tgactactca 

caagatgagt 

cagtgccaaa 

ccttgacttg 

tatcatcagc 

agaagacctg 

aaagctgaag 

ggctctcaag 

gacccggacc 

tcagatctat 

gtttgatgat 

ccattacctg 

aaccctgttc 

atacacaccc 

gttggatgac 

tgctgatgat 

gggcaagatg 

gaaccccgtg 

gccggaacaa 

ctccttggtg 

agttaccaag 

ttcagaagag 

catgctgaaa 

gctggaagcc 

tccgcagtgt 

agctaggaag 

gccatggtga 

agccctaagt 

aaa 



agcatccagt 

ttggctgtgg 

tacgtcttac 

aaggggcagc 

ctcagtgaag 

cataagagga 

gccagccggg 

atgcacctga 

atagcagagg 

ttgcggaagg 

ctcaacaaag 

cctcctccaa 

tttaatacaa 

gggagtcgtc 

ccaggtttga 

aggccctccc 

agtgctctgc 

cgcaggaaca 

cagaaaattc 

aaaaggaaga 

aaggctatcg 

ctgcagcagc 

gcagtggacc 

gatgagttta 

agaaaacacg 

gagctgaaga 

gaggcctgcc 

ctcaccaagc 

ggccaggctg 

agtctacccg 

tggtgtagcc 

ttgtactaaa 



gtatccaaag 

tgttctatgg 

aggagctgga 

agggacaaaa 

tgctgtgggt 

tcatgctgtt 

ccaggaccaa 

agaaacctgg 

atgaggacct 

ttcgcgccaa 

atatagtgat 

taaagctcta 

gtacaggcgg 

agattatact 

tgctcatggg 

tgttcgtgta 

tcatcaagtg 

tccctcctta 

aggtgactcc 

tgccctttac 

ttgagaagct 

acttcaggaa 

tgacattgcc 

aggagcttgt 

ataatgaagg 

cccacatcag 

gggcttacgg 

acttccagga 

cctggccttg 

acataagtcg 

ctcccacttt 

aaaaaaaaaa 



tgtgtacatc 

taccgagaaa 

taatccaggt 

acgtttccaa 

ctgtgccaac 

caccaatgaa 

agccggtgat 

gggctttgac 

cagggttcac 

ggagaccagg 

ctctgtgggc 

tcgggaaaca 

tttgcttctg 

ggagaaagag 

tttcaagccg 

cccagaggag 

tctggagaag 

ttttgtggct 

tccaggcttc 

tgaaaaaatc 

tcgcttcaca 

cctggaggcc 

caaggttgaa 

ttacccacca 

ttctggaagc 

caagggtacg 

gctgaagagt 

ctgaccagag 

tcctcagcca 

agggacttta 

gctgttcctt 

aaaaaaaaaa 



// 



GenBank 



Revised: May 2, 2000. 



Disclaimer | Write to the Help Desk 
NCB1 | NLM | NIH 



http://www.ncbi. nlm.nih.gov:80 



5/9/2000 



Summary sequence view 



Page 1 of 1 




C AC A OA* 




BLAST PubMed Nucleotide Protein Genome Structure PopSet Taxonomy Help 

Homo sapiens myelin-associated ligodendrocyte basic protein 
(MOBP) mRNA 



Views: 
GenBank view 
FASTA view 
ASN.1 view 
Coding Regions 
Feature table 



Accession: NM 006501 
Total Bases Sequenced: 915 bp 
Completed: Aug 10, 1999. 



CDS with gene 
and mRNA 

gene, tRNA, 
promoter... 

Other features 
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Legend: 
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Organism: Homo sapiens 
Genetic Code: \ 

Lineage: Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; 
Primates; Catarrhini; Hominidae; Homo. 



Cloning and expression of myelin-associated oligodendrocyte basic protein. A novel 
basic protein constituting the central nervous system myelin 

Yamamoto,Y., Mizuno,R., NishimuraJ., Ogawa,Y., Yoshikawa,H., Fujimura,H., Adachi,E., 
Kishimoto,T., Yanagihara,T. and Sakoda,S. 
J. Biol. Chem. 269 (50), 31725-31730 (1994) 
95081123 

COMMENT REFSEQ: This reference sequence was derived from D28 113.1. 
PROVISIONAL RefSeq: This is a provisional reference sequence record that has not yet 
been subject to human review. The final curated reference sequence record may be 
somewhat different from this one. 



Comments and suggestions to: [info(a),ncbi.nlm.nih.gov] 
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1 : NM 006501 . Homo sapiens myeli...[gi:5729930] PubMed, Protein, Related Sequences, LinkOut 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



NM_006501 915 bp mRNA PRI 10-AUG-1999 

Homo sapiens myelin-associated oligodendrocyte basic protein (MOBP 
mRNA. 
NM_006501 

NM 006501.1 GI.-5729930 



REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
MEDLINE 
COMMENT 



FEATURES 

source 



human. 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (sites) 

YamamotO/Y., Mizuno,R., Nishimura, T . , Ogawa,Y., Yoshikawa, H . , 
Fujimura,H., Adachi,E., Kishimoto, T. , Yanagihara, T . and Sakoda,S. 
Cloning and expression of myelin-associated oligodendrocytic basic 
protein. A novel basic protein constituting the central nervous 
system myelin 

J. Biol. Chem. 269 (50), 31725-31730 (1994) 
95081123 

REFSEQ: This reference sequence was derived from D28113.1. 
PROVISIONAL RefSeq: This is a provisional reference sequence recor 
that has not yet been subject to human review. The final curated 
reference sequence record may be somewhat different from this one. 
Location/Qualifiers 
1. .915 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
/clone="hOPRPl" 
/tissue_type="spinal cord" 
1. .915 

/gene="MOBP" 
/db_xref="LocusID: 4336" 
/db_xref="MIM: 600948 » 

9. .557 
/gene="MOBP" 
/codon_start=l 
/db_xref="LocusID: 4336 " 
/db_xref="MIM: 600948 " 

/product="myelin-associated oligodendrocyte basic protein 
/protein_id=" NP_0064 92.1 " 
/db_xref ="GI : 5729931" 

/translation-"MSQKPAKEGPRLSKNQKYSEHFSIHCCPPFTFLNSKKEIVDRK 
SICKSGCFYQKKEEDWICCACQKTRTSRRAKSPQRPKQQPAAPPAVVRAPAKPRSPP 
SERQPRSPPRSERQPRSPPRSERQPRSPPRSERQPRPRPEVRPPPAKQRPPQKSKQQ 
RSSPLRGPGASRGGSPVKASRF" 

BASE COUNT 219 a 291 c 235 g 170 t 

ORIGIN 

1 acagtgagat gagtcagaaa ccggccaagg agggtcccag actctccaaa aaccagaagt 
61 actccgaaca cttcagcata cactgctgcc cgccgttcac cttcctcaat tccaagaagg 



gene 



CDS 
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121 agatagtgga 
181 aggactggat 
241 agaggcccaa 
301 ggtcccctcc 
361 ggtcccctcc 
421 gtccccgccc 
481 aacagccgcg 
541 aagcttctag 
601 ggaggagttt 
661 tacccctgga 
721 ggcaactgct 
781 tactttgaag 
841 tgtcagtctg 
901 ctccagggac 



tcggaaatac 
ctgctgcgcc 
gcaacagcca 
gaggtctgag 
gaggtctgag 
agaggtccga 
cagcagcccc 
gttctgattg 
aaactgaatg 
tgaagttatc 
gatgctcatg 
atgtcatccc 
cttatccttc 
atgtg 



agcatctgta 
tgccagaaga 
gctgcgcccc 
cgtcagccac 
cgtcagccac 
ccaccgccag 
ctcagagggc 
aaaaggagga 
aacaacctcg 
tggcttcaaa 
gtccccatgg 
attgtcttct 
cacggactca 



agagcggctg 
ccagaaccag 
ccgcggtggt 
ggtcccctcc 
ggtcccctcc 
ccaagcagcg 
caggcgccag 
tcaggccaac 
gctcctggac 
tattatgcag 
catgggggcc 
gacctctata 
agtttcttca 



cttctaccag 
ccgccgtgcc 
cagagcgcca 
gaggtctgag 
gaggtctgag 
tccccctcag 
ccgtgggggg 
cccaaagaag 
tcattgcttc 
gggcaaacac 
tcagggcagc 
attgccactg 
atctgaagat 



aagaaagagg 
aagtcccctc 
gccaagccac 
cgtcagccac 
cgtcagccac 
aagtccaagc 
tcccccgtca 
aagtgaccaa 
acaacccatc 
ctgctgatgt 
ctgcctggag 
agagatctgc 
acatgtcttt 
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BLAST PubMed Nucleotide Protein Genome Structure PopSet Taxonomy Help 
Human mRNA for thyroglobulin 



Views: 
GenBank view 
FASTA view 
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Accession: X05615 

Total Bases Sequenced: 8448 bp 

Completed: Apr 2, 1988. 



CDS with gene 
and mRNA 

gene, tRNA, 
promoter... 

Other features 



1 1 

















8448 
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Legend: 

•a*"*" - CDS — - other feature 
I M, | - 1090-2089-. . . 



Organism: Homo sapiens 
Genetic Code: \ 

Lineage: Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; 
Catarrhini; Hominidae; Homo. 



Primary structure of human thyroglobulin deduced from the sequence of its 8448-base 
complementary DNA 

Malthiery,Y. and Lissitzky,S. 

Eur. J. Biochem. 165 (3), 491-498 (1987) 

87246630 

Direct Submission 

Malthiery,Y. 

Submitted (07-APR-1988) to the EMBL/GenBank/DDBJ databases 

A major human thyroglobulin epitope defined with monoclonal antibodies is mainly 
recognized by human autoantibodies 

Henry,M., Zanelli,E., Piechaczyk,M, Pau,B. and Malthiery,Y. 

Eur. J. Immunol. 22 (2), 315-319 (1992) 

92164705 

COMMENT patient); Data kindly reviewed (07-APR- 1988) by Malthiery Y. 
Comments and suggestions to: [infofyncbLnlm.nih.zov] 
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1 : X05615 . Human mRNA for thy...[gi:37173] 
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DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 
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REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

MEDLINE 
REFERENCE 

AUTHORS 

TITLE 

JOURNAL 
REFERENCE 

AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
COMMENT 

FEATURES 

source 



CDS 



HSTHYRR 8448 bp mRNA PRI 17-FEB-1997 

Human mRNA for thyroglobulin . 

X05615 

X05615.1 GI:37173 
thyroglobulin . 
human. 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; 
Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 8446) 
Malthiery,Y. and Lissitzky,S. 

Primary structure of human thyroglobulin deduced from the sequence 

of its 84 4 8-base complementary DNA 

Eur. J. Biochem. 165 (3), 491-498 (1987) 

87246630 

2 (bases 1 to 8448) 
Malthiery, Y . 
Direct Submission 

Submitted ( 07-APR-l 988 ) to the EMBL/GenBank/DDBJ databases 

3 (bases 1 to 8448) 

Henry, M., Zanelli,E., Piechaczyk, M. , Pau,B. and Malthiery, Y. 
A major human thyroglobulin epitope defined with monoclonal 
antibodies is mainly recognized by human autoantibodies 
Eur. J. Immunol. 22 (2), 315-319 (1992) 
92164705 
patient) ; 

Data kindly reviewed (07-APR-1988 ) by Malthiery Y. 
Location/Qualifiers 
1. .8448 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
/clone="Ml-M5 and B1-B4" 

/tissue_type="thyroid (obtained surgically from a 

Graves 1 disease" 

42. .8345 

/ codon_start=l 

/product = "thyroglobulin" 

/protein_id=" CAA29104 .1 " 

/db_xref="GI: 37174" 

/db__xref="SWISS-PROT:P01266" 

/ translation="MALVLEIFTLLASICWVSANIFEYQVDAQPLRPCELQRETAFL 
QADYVPQCAEDGSFQTVQCQNDGRSCWCVGANGSEVLGSRQPGRPVACLSFCQLQKQ 
ILLSGYINSTDTSYLPQCQDSGDYAPVQCDVQHVQCWCVDAEGMEVYGTRQLGRPKR 
PRSCEIRNRRLLHGVGDKSPPQCSAEGEFMPVQCKFVNTTDMMIFDLVHSYNRFPDA 
VTFSSFQRRFPEVSGYCHCADSQGRELAETGLELLLDEI YDTIFAGLDLPSTFTETT 
YRILQRRFLAVQSVISGRFRCPTKCEVERFTATSFGHPYVPSCRRNGDYQAVQCQTE 
PCWCVDAQGKEMHGTRQQGEPPSCAEGQSCASERQQALSRLYFGTSGYFSQHDLFSS 
EKRWASPRVARFATSCPPTIKELFVDSGLLRPMVEGQSQQFSVSENLLKEAIRAIFP 
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RGLARLALQFTTNPKRLQQNLFGGKFLVNVGQFNLSGALGTRGTFNFSQFFQQLGLA 

FLNGGRQEDLAKPLSVGLDSNSSTGTPEAAKKDGTMNKPTVGSFGFEINLQENQNAL 

FLASLLELPEFLLFLQHAISVPEDVARDLGDVMETVLDSQTCEQTPERLFVPSCTTE 

SYEDVQCFSGECWCVNSWGKELPGSRVRDGQPRCPTDCEKQRARMQSLMGSQPAGST 

FVPACTSEGHFLPVQC FNSECYCVDAEGQAIPGTRSAIGKPKKCPTPCQLQSEQAFL 

TVQALLSNSSMLPTLSDTYIPQCSTDGQWRQVQCNGPPEQVFELYQRWEAQNKGQDL 

PAKLLVKIMSYREAASGNFSLFIQSLYEAGQQDVFPVLSQYPSLQDVPLAALEGKRP 

PRENILLEPYLFWQILNGQLSQYPGSYSDFSTPLAHFDLRNCWCVDEAGQELEGMRS 

PSKLPTCPGSCEEAKLRVLQFIRETEEIVSASNSSRFPLGESFLVAKGIRLRNEDLG 

PPLFPPREAFAEFLRGSDYAIRLAAQSTLSFYQRRRFSPDDSAGASALLRSGPYMPQ 

DAFGSWEPVQCHAGTGHCWCVDEKGGFI PGSLTARSLQI PQCPTTCEKSRTSGLLSS 

KQARSQENPSPKDLFVPACLETGEYARLQASGAGTWCVDPASGEELRPGSSSSAQCP 

LCNVLKSGVLSRRVSPGYVPACRAEDGGFSPVQCDQAQGSCWCVMDSGEEVPGTRVT 

GQPACESPRCPLPFNASEVVGGTILCETISGPTGSAMQQCQLLCRQGSWSVFPPGPL 

CSLESGRWESQLPQPRACQRPQLWQTIQTQGHFQLQLPPGKMCSADYAGLLQTFQVF 

LDELTARGFCQIQVKT FGTLVSIPVCNNSSVQVGCLTRERLGVNVTWKSRLEDIPVA 

LPDLHDIERALVGKDLLGRFTDLIQSGSFQLHLDSKTFPAETIRFLQGDHFGTSPRT 

FGCSEGFYQVLTSEASQDGLGCVKCHEGSYSQDEECIPCPVGFYQEQAGSLACVPCP 

GRTTISAGAFSQTHCVTDCQRNEAGLQCDQNGQYRASQKDRGSGKAFCVDGEGRRLP 

WETEAPLEDSQCLMMQKFEKVPESKVIFDANAPVAVRSKVPDSEFPVMQCLTDCTED 

ACSFFTVSTTEPEISCDFYAWTSDNVACMTSDQKRDALGNSKATSFGSLRCQVKVRS 

GQDSPAVYLKKGQGSTTTLQKRFEPTGFQNMLSGLYNPIVFSASGANLTDAHLFCLL 

CDRDLCCDGFVLTQVQGGAIICGLLSSPSVLLCNVKDWMDPSEAWANATCPGVTYDQ 

SHQVILRLGDQEFIKSLTPLEGTQDTFTNFQQVYLWKDSDMGSRPESMGCRKNTVPR 

ASPTEAGLTTELFSPVDLNQVIVNGNQSLSSQKHWLFKHLFSAQQANLWCLSRCVQE 

SFCQLAEITESASLYFTCTLYPEAQVCDDIMESNTQGCRLILPQMPKALFRKKVILE 

KVKNFYTRLPFQKLMGISIRNKVPMSEKSISNGFFECERRCDADPCCTGFGFLNVSQ 

KGGEVTCLTLNSLGIQMCSEENGGAWRILDCGSPDIEVHTYPFGWYQKPIAQNNAPS 

CPLVVLPSLTEKVSLESWQSLALSSWVDPSIRHFDVAHVSTAATSNFSAVRDLCLS 

CSQHEACLI TTLQTQLGAVRCMFYADTQSCTHSLQGRNCRLLLREEATH I YRKPG I S 

LSYEASVPSVPISTHGRLLGRSQAIQVGTSWKQVDQFLGVPYAAPPLAERHFQAPEP 

NWTGSWDASKPRASCWQPGTRTSTSPGVSEDCLYLNVFIPQNVAPNASVLVFFHNTM 

REESEGWPAI DGS FLAAVGNLI VVTASYRVGVFGFLSSGSGEVSGNWGLLDQVAALT 

VQTHIRGFGGDPRRVSLAADRGGADVASIHLLTARATNSQLFRRAVLMGGSALSPAA 

ISHERAQQQAIALAKEVSCPMSSSQEWSCLRQKPANVLNDAQTKLLAVSGPFHYWG 

VIDGHFLREPPARALKRSLWVEVDLLIGSSQDDGLINRAKAVKQFEESRGRTSSKTA 

YQALQNSLGGEDSDARVEAAATWYYSLEHSTDDYASFSRALENATRDYFIICPIIDM 

SAWAKRARGNVFMYHAPENYGHGSLELLADVQFALGLPFYPAYEGQFSLEEKSLSLK 

MQYFSHFIRSGNPNYPYEFSRKVPTFATPWPDFVPRAGGENYKEFSELLPNRQGLKK 

DCSFWSKYISSLKTSADGAKGGQSAESEEEELTAGSGLREDLLSLQEPGSKTYSK" 



sia peptide 


42. .98 


mat peptide 


99. .8342 




/product="thyroglobulin" 


old sequence 


3165 




/citation=[l] 




/replace="t" 


old sequence 


3179 




/citation=[l] 




/replace-"t" 


old sequence 


3214. .3215 




/citation=[l] 




/replace-"ct" 


polyA sicrnal 


8427. .8432 


old sequence 


8433. .8434 




/citation= [1] 




/replace-"t" 


old sequence 


8447. .8448 



/citation= [1] 
/replace="t" 

BASE COUNT 1898 a 2297 c 2301 g 1952 t 

ORIGIN 

1 gcagtggttt ctcctccttc ctcccaggaa gggccaggaa aatggccctg gtcctggaga 
61 tcttcaccct gctggcctcc atctgctggg tgtcggccaa tatcttcgag taccaggttg 
121 atgcccagcc ccttcgtccc tgtgagctgc agagggaaac ggcctttctg aagcaagcag 
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181 actacgtgcc ccagtgtgca gaggatggca gcttccagac tgtccagtgc cagaacgacg 
241 gccgctcctg ctggtgtgtg ggtgccaacg gcagtgaagt gctgggcagc aggcagccag 
301 gacggcctgt ggcttgtctg tcattttgtc agctacagaa acagcagatc ttactgagtg 
361 gctacattaa cagcacagac acctcctacc tccctcagtg tcaggattca ggggactacg 
421 cgcctgttca gtgtgatgtg cagcatgtcc agtgctggtg tgtggacgca gaggggatgg 
481 aggtgtatgg gacccgccag ctggggaggc caaagcgatg tccaaggagc tgtgaaataa 
541 gaaatcgtcg tcttctccac ggggtgggag ataagtcacc accccagtgt tctgcggagg 
601 gagagtttat gcctgtccag tgcaaatttg tcaacaccac agacatgatg atttttgatc 
661 tggtccacag ctacaacagg tttccagatg catttgtgac cttcagttcc ttccagagga 
721 ggttccctga ggtatctggg tattgccact gtgctgacag ccaagggcgg gaactggctg 
781 agacaggttt ggagttgtta ctggatgaaa tttatgacac catttttgct ggcctggacc 
841 ttccttccac cttcactgaa accaccctgt accggatact gcagagacgg ttcctcgcag 
901 ttcaatcagt catctctggc agattccgat gccccacaaa atgtgaagtg gagcggttta 
961 cagcaaccag ctttggtcac ccctatgttc caagctgccg ccgaaatggc gactatcagg 
1021 cggtgcagtg ccagacggaa gggccctgct ggtgtgtgga cgcccagggg aaggaaatgc 
1081 atggaacccg gcagcaaggg gagccgccat cttgtgctga aggccaatct tgtgcctccg 
1141 aaaggcagca ggccttgtcc agactctact ttgggacctc aggctacttc agccagcacg 
1201 acctgttctc ttccccagag aaaagatggg cctctccaag agtagccaga tttgccacat 
1261 cctgcccacc cacgatcaag gagctctttg tggactctgg gcttctccgc ccaatggtgg 
1321 agggacagag ccaacagttt tctgtctcag aaaatcttct caaagaagcc atccgagcaa 
1381 tttttccctc ccgagggctg gctcgtcttg cccttcagtt taccaccaac ccaaagagac 
14 41 tccagcaaaa cctttttgga gggaaatttt tggtgaatgt tggccagttt aacttgtctg 
1501 gagcccttgg cacaagaggc acatttaact tcagtcaatt tttccagcaa cttggtcttg 
1561 caagcttctt gaatggaggg agacaagaag atttggccaa gccactctct gtgggattag 
1621 attcaaattc ttccacagga acccctgaag ctgctaagaa ggatggtact atgaataagc 
1681 caactgtggg cagctttggc tttgaaatta acctacaaga gaaccaaaat gccctcaaat 
1741 tccttgcttc tctcctggag cttccagaat tccttctctt cttgcaacat gctatctctg 
1801 tgccagaaga tgtggcaaga gatttaggtg atgtgatgga aacggtactc gactcccaga 
1861 cctgtgagca gacacctgaa aggctatttg tcccatcatg cacgacagaa ggaagctatg 
1921 aggatgtcca atgcttttcc ggagagtgct ggtgtgtgaa ttcctggggc aaagagcttc 
1981 caggctcaag agtcagagat ggacagccaa ggtgccccac agactgtgaa aagcaaaggg 
2041 ctcgcatgca aagcctcatg ggcagccagc ctgctggctc caccttgttt gtccctgctt 
2101 gtactagtga gggacatttc ctgcctgtcc agtgcttcaa ctcagagtgc tactgtgttg 
2161 atgctgaggg tcaggccatt cctggaactc gaagtgcaat agggaagccc aagaaatgcc 
2221 ccacgccctg tcaattacag tctgagcaag ctttcctcag gacggtgcag gccctgctct 
2281 ctaactccag catgctaccc accctttccg acacctacat cccacagtgc agcaccgatg 
2341 ggcagtggag acaagtgcaa tgcaatgggc ctcctgagca ggtcttcgag ttgtaccaac 
2401 gatgggaggc tcagaacaag ggccaggatc tgacgcctgc caagctgcta gtgaagatca 
2461 tgagctacag agaagcagct tccggaaact tcagtctctt tattcaaagt ctgtatgagg 
2521 ctggccagca agatgtcttc ccggtgctgt cacaataccc ttctctgcaa gatgtcccac 
2581 tagcagcact ggaagggaaa cggccccagc ccagggagaa tatcctcctg gagccctacc 
2641 tcttctggca gatcttaaat ggccaactca gccaataccc ggggtcctac tcagacttca 
2701 gcactccttt ggcacatttt gatcttcgga actgctggtg tgtggatgag gctggccaag 
2761 aactggaagg aatgcggtct gagccaagca agctcccaac gtgtcctggc tcctgtgagg 
2821 aagcaaagct ccgtgtactg cagttcatta gggaaacgga agagattgtt tcagcttcca 
2881 acagttctcg gttccctctg ggggagagtt tcctggtggc caagggaatc cggctgagga 
2941 atgaggacct cggccttcct ccgctcttcc cgccccggga ggctttcgcg gagtttctgc 
3001 gtgggagtga ttacgccatt cgcctggcgg ctcagtctac cttaagcttc tatcagagac 
3061 gccgcttttc cccggacgac tcggctggag catccgccct tctgcggtcg ggcccctaca 
3121 tgccacagtg tgatgcgttt ggaagttggg agcctgtgca gtgccacgct gggactgggc 
3181 actgctggtg tgtagatgag aaaggagggt tcatccctgg ctcactgact gcccgctctc 
3241 tgcagattcc aeagtgcccg acaacctgcg agaaatctcg aaccagtggg ctgctttcca 
3301 gttggaaaca ggctagatcc caagaaaacc catctccaaa agacctgttc gtcccagcct 
3361 gcctagaaac aggagaatat gccaggctgc aggcatcggg ggctggcacc tggtgtgtgg 
3421 accctgcatc aggagaagag ttgcggcctg gctcgagcag cagtgcccag tgcccaagcc 
3481 tctgcaatgt gctcaagagt ggagtcctct ctaggagagt cagcccaggc tatgtcccag 
3541 cctgcagggc agaggatggg ggcttttccc cagtgcaatg tgaccaggcc cagggcagct 
3601 gctggtgtgt catggacagc ggagaagagg tgcctgggac gcgcgtgacc gggggccagc 
3661 ccgcctgtga gagcccgcgg tgtccgctgc cattcaacgc gtcggaggtg gttggtggaa 
3721 caatcctgtg tgagacaatc tcgggcccca caggctctgc catgcagcag tgccaattgc 
3781 tgtgccgcca aggctcctgg agcgtgtttc caccagggcc attgatatgt agcctggaga 
3841 gcggacgctg ggagtcacag ctgcctcagc cccgggcctg ccaacggccc cagctgtggc 
3901 agaccatcca gacccaaggg cactttcagc tccagctccc gccgggcaag atgtgcagtg 
3961 ctgactacgc gggtttgctg cagactttcc aggttttcat attggatgag ctgacagccc 
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4021 gcggcttctg ccagatccag gtgaagactt 
4081 acaactcctc tgtgcaggtg ggttgtctga 
4141 ggaaatcacg gcttgaggac atcccagtgg 
4201 gagccttggt gggcaaggat ctccttgggc 
4261 tccagcttca tctggactcc aagacgttcc 
4321 accactttgg cacctctcct aggacacggt 
4381 tgacaagtga ggccagtcag gacggactgg 
4441 cccaagatga ggaatgcatt ccttgtcctg 
4501 tggcctgtgt cccatgtcct gtgggcagaa 
4561 ctcactgtgt cactgactgt cagaggaacg 
4 621 agtatcgagc cagccagaag gacaggggca 
4 681 ggcggaggct gccatggtgg gaaacagagg 
4741 tgcagaagtt tgagaaggtt ccagaatcaa 
4801 ctgtcagatc caaagttcct gattctgagt 
4861 cagaggacga ggcctgcagc ttcttcaccg 
4921 atttctatgc ttggacaagt gacaatgttg 
4 981 cactggggaa ctcaaaggcc accagctttg 
5041 gccatggtca agattctcca gctgtgtatt 
5101 ttcagaaacg ctttgaaccc actggtttcc 
5161 ttgtgttctc agcctcagga gccaatctaa 
5221 gcgaccgtga tctgtgttgc gatggcttcg 
5281 tctgtgggtt gctgagctca cccagtgtcc 
5341 cctctgaagc ctgggctaat gctacatgtc 
5401 aggtgatatt gcgtcttgga gaccaggagt 
54 61 ctcaagacac ctttaccaat tttcagcagg 
5521 ctcggcctga gtctatggga tgtagaaaaa 
5581 aagcaggttt gacaacagaa cttttctccc 
5641 gaaatcaatc actatccagc cagaagcact 
5701 aggcaaacct atggtgcctt tctcgttgtg 
5761 agataacaga gagtgcatcc ttgtacttca 
5821 gtgatgacat catggagtcc aatacccagg 
5881 aggccctgtt ccggaagaaa gttatactgg 
5941 tgccgttcca aaaactgatg gggatatcca 
6001 ctatttctaa tgggttcttt gaatgtgaac 
6061 gctttggatt tctaaatgtt tcccagttaa 
6121 acagcttggg aattcagatg tgcagtgagg 
6181 gtggctctcc tgacattgaa gtccacacct 
6241 ctcaaaataa tgctcccagt ttttgccctt 
6301 tgtctctgga atcgtggcag tccctggccc 
6361 ggcactttga tgttgcccat gtcagcactg 
6421 acctctgttt gtcggaatgt tcccaacatg 
6481 aactcggggc tgtgagatgt atgttctatg 
6541 agggtcggaa ctgccgactt ctgcttcgtg 
6601 gaatctctct gctcagctat gaggcatctg 
6661 ggctgctggg caggtcccag gccatccagg 
6721 tccttggagt tccatatgct gccccgcccc 
6781 ccttgaactg gacaggctcc tgggatgcca 
6841 gcaccagaac atccacgtct cctggagtca 
6901 tccctcagaa tgtggcccct aacgcgtctg 
6961 gggaggagag tgaaggatgg ccggctatcg 
7021 tcatcgtggt cactgccagc taccgagtgg 
7081 gagaggtgag tggcaactgg gggctgctgg 
7141 cccacatccg aggatttggc ggggaccctc 
7201 gggctgatgt ggccagcatc caccttctca 
7261 ggagagctgt gctgatggga ggctccgcac 
7321 gggctcagca gcaggcaatt gctttggcaa 
7381 aagaagtggt gtcctgcctc cgccagaagc 
7441 agctcctggc cgtgagtggc cctttccact 
7501 tccgtgagcc tccagccaga gcactgaaga 
7561 ttgggagttc tcaggacgac gggctcatca 
7621 aaagtcgagg ccggaccagt agcaaaacag 
7681 gtggcgagga ctcagatgcc cgcgtcgagg 
7741 actccacgga tgactatgcc tccttctccc 
7801 ttatcatctg ccctataatc gacatggcca 



ttggcaccct ggtttccatt cctgtctgca 
ccagggagcg tttaggagtg aatgttacat 
cttctcttcc tgacttacat gacattgaga 
gcttcacaga tctgatccag agtggctcat 
cagcggaaac catccgcttc ctccaagggg 
ttgggtgctc ggaaggattc taccaagtct 
gatgcgttaa gtgccatgaa ggaagctatt 
ttggattcta ccaagaacag gcagggagct 
cgaccatttc tgccggagct ttcagccaga 
aagcaggcct gcaatgtgac cagaatggcc 
gtgggaaggc cttctgtgtg gacggcgagg 
cccctcttga ggactcacag tgtttgatga 
aggtgatctt cgacgccaat gctcctgtgg 
tccccgtgat gcagtgcttg acagattgca 
tgtccacgac ggagccagag atttcctgtg 
cctgcatgac ttctgaccag aaacgagatg 
gaagtcttcg ctgccaggtg aaagtgagga 
tgaaaaaggg ccaaggatcc accacaacac 
aaaacatgct ttctggattg tacaacccca 
ccgatgctca cctcttctgt cttcttgcat 
tcctcacaca ggttcaagga ggtgccatca 
tgctttgtaa tgtcaaagac tggatggatc 
ctggtgtgac atatgaccag gagagccacc 
tcatcaagag tctgacaccc ttagaaggaa 
tttatctctg gaaagattct gacatggggt 
acacagtgcc aaggccagca tctccaacag 
ctgtggacct caaccaggtc attgtcaatg 
ggcttttcaa gcacctgttt tcagcccagc 
tgcaggagca ctctttctgt cagctcgcag 
cctgcaccct ctacccagag gcacaggtgt 
gctgcagact gatcctgcct cagatgccaa 
aagataaagt gaagaacttt tacactcgcc 
ttagaaataa agtgcccatg tctgaaaaat 
gacggtgcga tgcggaccca tgctgcactg 
aaggaggaga ggtgacatgt ctcactctga 
agaatggagg agcctggcgc attttggact 
atcccttcgg atggtaccag aagcccattg 
tggttgttct gccttccctc acagagaaag 
tctcttcagt ggttgttgat ccatccatta 
ctgccaccag caatttctct gctgtccgag 
aggcctgtct catcaccact ctgcaaaccc 
ctgatactca aagctgcaca catagtctgc 
aagaggccac ccacatctac cggaagccag 
taccttctgt gcccatttcc acccatggcc 
tgggtacctc atggaagcaa gtggaccagt 
tggcagagag gcacttccag gcaccagagc 
gcaagccaag ggccagctgc tggcagccag 
gtgaagattg tttgtatctc aatgtgttca 
tgctggtgtt cttccacaac accatggaca 
acggctcctt cttggctgct gttggcaacc 
gtgtcttcgg cttcctgagt tctggatccg 
accaggtggc ggctctgacc tgggtgcaga 
ggcgcgtgtc cctggcagca gaccgtggcg 
cggccagggc caccaactcc caacttttcc 
tctccccggc cgccgtcatc agccatgaga 
aggaggtcag ttgccccatg tcatccagcc 
ctgccaatgt cctcaatgat gcccagacca 
actggggtcc tgtgatcgat ggccacttcc 
ggtctttatg ggtagaggtc gatctgctca 
acagagcaaa ggctgtgaag caatttgagg 
ccttttacca ggcactgcag aattctctgg 
ctgctgctac atggtattac tctctggagc 
gggctctgga gaatgccacc cgggactact 
gtgcctgggc aaagagggcc cgaggaaacg 
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7861 tcttcatgta ccatgctcct gaaaactacg gccatggcag cctggagctg ctggcggatg 

7921 ttcagtttgc cttggggctt cccttctacc cagcctacga ggggcagttt tctctggagg 

7981 agaagagcct gtcgctgaaa atcatgcagt acttttccca cttcatcaga tcaggaaatc 

8041 ccaactaccc ttatgagttc tcacggaaag tacccacatt tgcaaccccc tggcctgact 

8101 ttgtaccccg tgctggtgga gagaactaca aggagttcag tgagctgctc cccaatcgac 

8161 agggcctgaa gaaagccgac tgctccttct ggtccaagta catctcgtct ctgaagacat 

8221 ctgcagatgg agccaagggc gggcagtcag cagagagtga agaggaggag ttgacggctg 

8281 gatctgggct aagagaagat ctcctaagcc tccaggaacc aggctctaag acctacagca 

8341 agtgaccagc ccttgagctc cccaaaaacc tcacccgagg ctgcccacta tggtcatctt 

8401 tttctctaaa atagttactt accttcaata aagtatctac atgcggtg 



// 



n 



■n 
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BLAST PubMed Nucleotide Protein Genome Structure PopSet Taxonomy Help 

H.sapiens mRNA for put. B7,3 m lecule of CD80-CD60 protein 
family 



Views: 
GenBank view 
FASTA view 
ASN.1 view 
Coding Regions 
Feature table 



Accession: Y07827 

Total Bases Sequenced: 1 182 bp 

Completed: Jan 8, 1997. 



CDS with gene 
and mRNA 

gene, tRNA, 
promoter... 



Other features 

l l l} l )U| l III | M II | II |ll ll| lllljlll I j l III |l ll| l )M |llll|llll | l lll| H IIJIIII| III |l ^ j 

l] " ll " 1 " T ' 1i;r ' n " w l w:TO '"" 1 " " "" " " " B7 # 3 molecule 

Legend: 

— - CDS 

- 100-200-.-. 



Organism: Homo sapiens 
Genetic Code: i 

Lineage: Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; 
Catarrhini; Hominidae; Homo. 



Cloning, structural analysis and mapping of B30 and B7 family members, to the MHC 
and other chromosomal regions. Toward the identification of the ancestral major 
histocompatability complex 

Henry ,J., Ribouchon,M.T., Depetris,D., Mattei,M.G., Offer,C, Tazi-Ahnini,R. and 

Pantarotti,P. 

Unpublished 

Direct Submission 
Pontarotti,P. 

Submitted (06-SEP-1996) P. Pontarotti, Unite 1 19 INSERM, 27 bd.Lei Roure, 13009 
Marseille, FRANCE 

Comments and suggestions to: finfo^ncblnlm.nihgov] 
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History 



Clipboard 



1 : Y07827 . H.sapiens mRNA for...[gi: 1770367] Protein - Related Sequence*, LinkOut 

LOCUS HSB73 1182 bp mRNA PRI 08-JAN-1997 

DEFINITION H.sapiens mRNA for put. B7,3 molecule of CD80-CD60 protein family. 

ACCESSION Y07827 

VERSION Y07827.1 GI: 1770367 

KEYWORDS major histocompatibility complex. 

SOURCE human. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; 
Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 1182) 

AUTHORS Henry, J., Ribouchon, M. T . , Depetris,D., Mattei, M.G. , Offer, C, 

Tazi-Ahnini, R. and Pantarotti, P . 
TITLE Cloning, structural analysis and mapping of B30 and B7 family 

members, to the MHC and other chromosomal regions. Toward the 
identification of the ancestral major histocompatability complex 
JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 1182) 
AUTHORS Pontarotti, P. 
TITLE Direct Submission 

JOURNAL Submitted ( 06-SEP-1996 ) P. Pontarotti, Unite 119 INSERM, 27 bd.Lei 
Roure, 13009 Marseille, FRANCE 
FEATURES Location/Qualifiers 
source 1 . .1182 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
/chromosome="6" 
/map="6p23" 
CDS 97. .1149 

/note="cDNA corresponding to the full length of EST clone 
in T86577; expresssion : antigen presenting cells" 
/codon_start=l 

/product="put . B7,3 molecule of CD80-CD86 family" 

/protein_id=" CAA69164 . 1 " 

/db_xref-"GI: 1770368" 

/ db_xr e f = " S PTREMBL : Q9 9 4 2 0 " 

/translation^' MAS FLAFLLLNFRVCLLLLQLLMPHSAQFSVLGPSGPILAMVG 
DADLPCHLFPTMSAETMELKWVSSSLRQWNVYADGKEVEDRQSAPYRGRTSILRDG 
TAGKAAFRIHNVTGSDRWKYLCYFQDGDFYEKALVELKVAALGSDLHVDVKGYKDGG 
HLECRSTGWYPQPQIQWSNNKGENIPTVEAPWADGVGLYAVAASVIMRGSSGEGVS 
T I RNSLLGLEKTAS I S I ARPFFRSAQRWI AALAGTLPVLLLLLGGAGYFLWQQQEEK 
TQFRKKKREQELREMAWSTMKQEQSTRVKLLEELRWRSIQYASRGERHSAYNEWKKA 
FKPGEEMLQMRLHFVK" 

BASE COUNT 322 a 263 c 335 g 262 t 

ORIGIN 

1 ttcggcacga gagaactatt aactgccttt cttctgtggg ctgtgatttt cagaggggaa 
61 tgctaaagta tctcctgata tgcagcatga atgaaaatgg caagtttcct ggccttcctt 
121 ctgctcaact ttcgtgtctg cctccttttg cttcagctgc tcatgcctca ctcagctcag 
181 ttttctgtgc ttggaccctc tgggcccatc ctggccatgg tgggtgaaga cgctgatctg 
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241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 



// 



ccctgtcacc 
agcctaaggc 
gcaccgtatc 
ttccgaatac 
ggtgacttct 
cacgttgatg 
tggtaccccc 
gaagcacctg 
agaggcagct 
aagacagcca 
gccctggcag 
tggcaacagc 
agagaaatgg 
gaactcagat 
gaatggaaaa 
gttaaataaa 



tgttcccgac 
aggtggtgaa 
gagggagaac 
acaacgtcac 
atgaaaaagc 
tgaagggtta 
aaccccaaat 
tggttgcaga 
ctggggaggg 
gcatttccat 
ggaccctgcc 
aggaggaaaa 
catggagcac 
ggagaagtat 
aggccctctt 
atggatgaat 



catgagtgca 
cgtgtatgca 
ttcgattctg 
aggctctgac 
cctggtggag 
caaggatgga 
acagtggagc 
cggagtgggc 
tgtatcctgt 
cgcaagaccc 
tgtcttgctg 
aaagactcag 
aatgaagcaa 
ccagtatgca 
caagcctggt 
gaaaaaaaaa 



gagaccatgg 
gatggaaagg 
cgggatggca 
aggtggaagt 
ctgaaggttg 
gggatccatc 
aacaacaagg 
ctgtatgcag 
accatcagaa 
ttcttcagga 
ctgctccttg 
ttcagaaaga 
gaacaaagca 
tctcggggag 
gaggaaatgc 
aaaaaaaaaa 



agctgaagtg 
aagtggaaga 
tcactgcagg 
acctgtgtta 
cagcactggg 
tggagtgcag 
gagagaacat 
tagcagcatc 
attccctcct 
gcgcccagag 
ggggagccgg 
aaaagagaga 
caagagtgaa 
agagacattc 
ttcagatgag 
aa 



ggtaagttcc 
caggcagagt 
gaaggctgct 
tttccaagat 
ttctgatctt 
gtccactggc 
cccgactgtg 
tgtgatcatg 
cggcctggaa 
gtggatcgcc 
ttacttcctg 
gcaagagttg 
gctcctggag 
agcctataat 
gctccacttt 
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BLAST PubMed Nucleotide Protein Genome Structure PopSet Taxonomy Help 
B.verrucosa mRNA for pollen allergen Bet v 4 



C AC A 0 AC I^BPT aCOC 



Views: 
GenBank view 
FASTA view 
ASN.1 view 
Coding Regions 
Feature table 



Accession: Y 12560 

Total Bases Sequenced: 496 bp 

Completed: Apr 24, 1997. 



CDS with gene 
and mRNA 

gene, tRNA, 
promoter... 



Other features 

j — i i i i i i i i I i i i i i i i i i I i i i i i i i i i I i i i i i i i i i I i i i i i i i i 



496 



S^Bet v 4 



Legend: 

— »- CDS 
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mmmm - gene 

1 - 100-200-... 



Organism: Betula pendula 
Genetic Code: 1_ 

Lineage: Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 
euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; Rosidae; Fagales; 
Betulaceae; Betula. 



Molecular characterization, expression in Escherichia coli, and epitope analysis of a two 
EF-hand calcium-binding birch pollen allergen, Bet v 4 

Twardosz,A., Hayek,B., Seiberler,S., Vangelista,L., Elfman,L., Gronlund,H., Kraft,D. and 
Valenta,R. 

Biochem. Biophys. Res. Commun. 239 (1), 197-204 (1997) 
98005106 

Direct Submission 

Valenta,R. 

Submitted (14-APR-1997) R. Valenta, Institute of General & Experimental Pathology, 
General Hospital, Waehringer Guertel 18-20, 1090 Vienna, AUSTRIA 
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% NCBI 



PubMed 
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Genome 



Structure 



Search l Nucleotiae @ l fori 



CienBank 



Limits Preview/Index 



History 



Clipboard 



PopSet 



1 : Y12560 . B.verrucosa mRNA f...[gi:2051992] 



PubMed, Protein 



LOCUS BVBETV4PA 4 96 bp mRNA PLN 24-OCT-1997 

DEFINITION B.verrucosa mRNA for pollen allergen Bet v 4. 

ACCESSION Y12560 

VERSION Y12560.1 GI: 2051992 

KEYWORDS calcium-binding pollen allergen; EF-hand protein; pollen allergen. 
SOURCE European white birch. 

ORGANISM Betula pendula 

Eukaryota ; Viridiplant ae ; St rept ophyt a ; Embryophyt a ; Tracheophyt a ; 
euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; 
Rosidae; Fagales; Betulaceae; Betula. 
REFERENCE 1 (bases 1 to 496) 

AUTHORS Twardosz,A., Hayek, B., Seiberler, S . , Vangelista, L . , Elfman,L., 

Gronlund,H., Kraft, D. and Valenta,R. 
TITLE Molecular characterization, expression in Escherichia coli, and 

epitope analysis of a two EF-hand calcium-binding birch pollen 
allergen, Bet v 4 

Commun. 239 (1), 197-204 (1997) 



JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 



CDS 



Biochem . Biophys . Res . 
98005106 

2 (bases 1 to 496) 

Valenta, R. 

Direct Submission 

Submitted ( 14-APR-1997 ) R. Valenta, Institute of General & 
Experimental Pathology, General Hospital, Waehringer Guertel 18-20 
1090 Vienna, AUSTRIA 

Location/Qualifiers 

1. .496 

/organism="Betula pendula" 
/db_xref="taxon:3505" 
/clones" 6" 

/tissue_t ype= "pollen" 
/clone_lib="lambda gtll" 
28. .285 

/gene="Bet v 4" 
28. .285 

/gene="Bet v 4" 

/note="two EF-hand calcium-binding birch pollen allergen" 

/codon_start=l 

/protein_id= n CAA73147 . 1 " 

/db_xref ="GI : 2051993" 

/ db_xr e f = " S PTREMBL : 00 4 1 3 1 " 

/translation="MADDHPQDKAERERIFKRFDANGDGKISAAELGEALKTLGSIT 
DEVKHMMAEIDTDGDGFISFQEFTDFARANRGLLKDVAKIF" 
92 c 105 g 161 t 



138 a 



BASE COUNT 
ORIGIN 

1 gaattcccaa caacagtgga aacaaaaatg gctgatgatc atccacagga caaggctgaa 

61 cgcgagcgca ttttcaagcg ctttgacgcc aatggcgatg gtaaaatctc tgcagcagag 

121 cttggggagg ccttgaaaac acttggctcc atcacaccgg atgaggtgaa acatatgatg 

181 gctgagattg acaccgatgg cgacggcttc atttcgttcc aagagttcac ggattttgct 
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241 cgtgctaatc gtggtttact aaaggatgtt gccaagatat tttaatgtct ctgtctcttt 

301 ctcttttttg gcatattttc atgtcatgat ctccttgttt agagtgattt attttcatgg 

361 ctattgtccg tttggatttt tctcattata gaattatttt tggtagttcg actgtcacct 

421 cttgtattct aatatcaatg gtgtttgacc gtattattgt aacaagaaaa tggttcatca 

481 aaaaaaaaaa aaaaaa 
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